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;  -*-  MODE:  TEXT;  PACKAGE:  CME;  MUSER:  T;  BASE:  10;  SYNTAX:  ZETALISP;  -*- 

;>  ************************************************************************************ 

;>  (c)  1985  Thinking  Machines  Corporation  of  Cambridge,  Massachusetts. 

;>  All   rights  reserved 

;> 

;>  This  notice  is  intended  as  a  precaution  against  inadvertent  publication 

;>  and  does  not  constitute  an  admission  or  acknowledgement  that  publication 

;>  has  occurred  or  constitute  a  waiver  of  confidentiality.     The  Connection  Machine  Chip 

;>  is  the  proprietary  and  confidential  property  of  Thinking  Machines  Corporation. 

; >  ************************************************************************************ 


Chip  Specification  for  a  Connection  Machine  Chip  (version  Beta) 

{July  1985$ 

-brewster,  danny,  george  and  guy 

This  document  describes  the  functions  of  the  Connection  Machine  Chip 
(version  Beta).     It  describes  everything  a  programmer  and  board 
designer  will  have  to  know  about  the  functions  of  the  chip.     The  exact 
implementation  is  not  described  here.     To  program  the  router  more 
information  is  needed  than  that  contained  in  this  document;  this  will  be 
left  to  some  other  document. 

The  chip  contains  16  processors  and  a  router.     Each  processor  is  a 
one-bit  slice  with  an  ALU,  flag  memory,  and  access  paths  to  the  off— chip 
memory.     All  processors  receive  instructions  from  a  common  stream 
supplied  externally  on  pins  1-0  through  1-16. 

The  chip  provides  two  different  mechanisms  for  communication  among 
processors:     the  NEWS  connections  and  the  router.     The  NEWS  connections 
allow  each  processor  to  communicate  directly  with  its  immediate 
neighbors  in  1,  2,  3  or  4  dimensions.     The  router  is  an  interface  to  a 
packet-switched  communication  network. 

Change  log  for  versions: 
Major  changes  from  version  Alpha  include  the  elimination  of  the  NEWS 
pins,  the  addition  of  error  correction  logic,  a  LOADI  and  a  RUG 
operation,  merging  of  the  Global,   Input  and  Led  pins,  and  improved 
timing.     There  are  also  improvements  in  the  router.  The  chip  is  capable 
of  operation  in  "parity  mode"  for  partial  compatibility  with  version 
Alpha  chips. 


******************  CHANGE  LOG  ****************** 

10/26/85  16:58:49  danny:  Created. 

11/02/85  11:42:54  danny:     Flip  only  happens  on  store  and  rug-r-x 

11/02/85  11:43:47  danny:     Define  what  happens  when  an  error  occures  while 
referencing  an  error  latch  in  the  rug.  The  latch  ignores  the  error. 

11/02/85  13:51:18  danny:     Rug-r-a  and  Rug-r-c  now  load  ALU-SUM  and  COND  latches. 

11/06/85  23:58:03  danny:     Documentation  on  RUG-ACC  and  note  on  accumulation. 

11/07/85  16:05:55  george:  Added  s-match  and  s-match-clr  Snarf  encodings. 

flush  accummulate,  flush  run  modes,  update  flipper 

replace  rug-acc  with  rug-news 


11/14/85  23:03:44  danny: 
11/14/85  23:21:41  danny: 


11/14/85  23:47:38  danny:     merged  in  gls's  news  spec 
but  without  FSEL  and  2  (not  4)  FLIP  registers 

11/15/85  02:22:03  danny:     fixed  wrong  numbers  in  error  table 

11/15/85  15:13:58  george:  fixed  numerous  small  mistakes 
Flushed  i/o  flag  (i/o  rug  bit  remains). 
Updated  chip  latches  table. 
Changed  to  even  parity  everywhere. 
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;;;  Moved  :Signal-Error  field  to  :Error  register. 

;;;  Added  Message-p  register. 

;;;  Flushed  stuff  at  end  i nt o  >cm>beta>un imp  I imented-i deas . text . 

;;;  11/15/85  22:01:16  danny:     updated  which-dims-*  in  tables  an  in  rug 

;;;  11/16/85  16:27:00  danny:     i-par i ty-er ror  is  not  checks  on  reads 

;;;  11/18/85  11:36:49  George:     expanded  error  code  and  flush  signal-error 

;;;  corrected  spelling  error:  which-dims-recieve  — >  which-dims-recei ve 

;;;  11/20/85  00:04:44  danny:     wrote  more  on  signal  error. 

;;;  12/04/85  11:56:44  George:     made  led-cs  drve  consistant  and  fix  c  latch  spec 

;;;  1/06/86  12:35:36  danny:  The  Flipper  does  permute  the  data 

;;;  written  to  memory  on  rug-r-a  and  rug-r-c  cycles. 

;;;  1/06/86  12:35:36  danny:  added  ecc5  input 

;;;  1/07/86  10:25:54  danny:     documented  PMODE  bit  of  instruction 

;;;  1/07/86  11:03:39  danny:     updateing  timing  diagram 

;;;  1/14/86  09:23:09  george:   added  LOADI  RUG-R-A  sequence 

;;;  3/05/86  12:34:43  danny:     documented  format  of  rug  dimension-n 

;  ;  ;  ***************  END  OF  CHANGE  LOG  *************** 
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PROCESSOR  DATA  PATHS 

This  section  describes  the  major  latches  and  data  paths.     The  state 
bits  of  the  machines  are  divided  among  "latches,"  "registers,"  and 
"Flags."  The  flags  are  temporary  state  bits  associated  with  each 
processor,  and  are  explicitly  written  during  the  Store  Cycle  of  an 
instruction  sequence.     The  latches  are  used  for  special  purposes  and  are 
implicity  written  during  the  execution  of  various  instructions  (see 
table  xxx).     The  registers  hold  status  information  associated  with  the 
entire  chip  (see  Table  xxx).     The  a  bank  of  status  registers  is  called 
the  Rug  (see  RUG  section),  and  it  is  written  explictly  on  a  RUG-W-A  or 
RUG-W-C  Cycle. 

Each  processor  is  intended  to  be  associated  with  an  external  memory. 

In  a  typical  bit  operation  (see  the  Sequences  Section)  each 
processor's  ALU  takes  five  single  bit  inputs.     Two  of  these  inputs  (A 
and  B)  come  from  the  external  memory.     The  third  comes  from  one  of  the 
processor's  internal   flags  (called  F) .     There  are  two  special    inputs  to 
the  ALU  called  the  COND  input,  which  is  loaded  from  a  flag,  and  the  C 
input  which  is  usually  loaded  with  the  same  thing  as  A.     Each  processor 
has  two  outputs:     Sum  (written  to  the  external  memory)  and  Carry  (written 
to  a  flag).     The  Cond  flag  determines  whether  the  sum  and  carry  are  written 
with  the  newly  computed  values  or  rewritten  with  their  previous  values. 

ALU 

The  ALU  on  the  chip  is  composed  of  16  single-bit  ALUs,  one  for  each 
processor  on  the  chip.     Each  one-bit  ALU  does  the  same  operation  on  the 
bits  supplied  by  the  input   latches.     Each  processor  has  different  data. 

A  normal  operation  takes  place  in  several  cycles.     During  the  load  cycles 
of  an  instruction  cycle  (LoadA  and  LoadB)  each  processor's  ALU  input 
latches  are  read  from  a  flag  into  F,  another  flag  into  COND  (can  be 
inverted  by  an  instruction),  and  one  bit  from  each  of  any  two  memory 
locations  into  A  and  B.     The  ALU  produces  its  two  results,  Carry  and  Sum, 
on  the  store  cycle.     The  store  cycle  specifies  what  flag  to  update  with 
the  carry  and  what  memory  location  to  update  with  the  Sum.     Usually  the 
memory  location  for  writing  is  the  same  as  was  read  to  load  the  A  latch. 
During  the  store  cycle,   if  a  processor's  COND  latch  contains  1,  the  Sum 
output   is  written  back  into  memory  and  the  Carry  goes  to  a  flag.     If  the 
COND  latch  contains  0,  then  the  ALU  writes  the  contents  of  the  C  latch 
back  to  memory,  and  does  not  update  the  write  flag.     Therefore,  nothing 
changes . 

The  SUM  output  of  the  processors  passes  through  a  permutation  device 
called  the  flipper,  that  determines  which  output  goes  to  which  bit  of 
memory.  This  is  described  in  flipper  section. 

The  ALU  can  compute  any  boolean  function  of  its  three  inputs  for  each 
of  its  two  outputs.  These  two  functions  are  specified  by  two  8-bit  truth 
tables.     The  truth  table  for  the  carry  is  specified  by  the  ALU-CARRY 
field  of  the  LOADA  instruction.     The  truth  table  for  the  sum  is 
specified  by  the  ALU-SUM  field  of  the  LOADB  instruction.  The  truth 
tables  are  constructed  so  they  will  produce  the  correct  value 
when  indexed  as  follows:     The  a-input  becomes  the  high  order  bit  of  a 
three  bit   index,  the  b-input  becomes  the  middle  bit  and  the  flag-input 
becomes  the  low  order  bit  in  the  index.     The  selection  of  these  bits  is 
determined  as  shown  in  Table  1.     For  example,  the  truth  table  for  the 
function  that  computes  the  logical  AND  of  A  and  B  would  be  11000000 
(binary) . 
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Table  I. 


A  T  nnnt 

r\     X  1 1  p  U  L 

lj  input 

P.nrr v— fti it  m  1 1 

Q 1 1  m — O  1 1 1  nut 

0 

0 

0 

ALU-CARRY-0 

ALU-SUM-0 

0 

e 

1 

ALU-CARRY-1 

ALU-SUM-1 

0 

1 

0 

ALU-CARRY-2 

ALU-SUM-2 

0 

1 

1 

ALU-CARRY-3 

ALU-SUM-3 

1 

0 

0 

ALU-CARRY-4 

ALU-SUM-4 

1 

0 

1 

ALU-CARRY-5 

ALU-SUM-5 

1 

1 

0 

ALU-CARRY-6 

ALU-SUM-6 

1 

1 

1 

ALU-CARRY-7 

ALU-SUM-7 

There  are  also  several    latches  in  the  machine  that  are  used 
for  special  purposes.  Data  from  the  router  is  stored  in  the 
RBO  latch  before  it  goes  to  memory  for  buffering.     Data  from 
memory  is  stored  first  in  the  RBP  latch  and  then  in  the  RBI 
latch  as   it  goes  to  the  router. 


Processor  Flags 

Each  processor  has  associated  with  it  4  general   read/write  flags  and  4 
additional  special  purpose  flags.     The  general   flags  are  used  to  store 
single— bit  temporary  values,  such  as  the  carry  during  a  serial  addition 
ope  rat  i  on . 


Addr 

Name 

Write? 

Comments 

0 

Flag0 

yes 

1 

Flagl 

yes 

2 

Flag2 

yes 

3 

Flag3 

yes 

4 

no 

Rese  rved 

5 

no 

Rese  rved 

6 

Com-E 

Yes 

Writing  this  will  strobe  what  was 

read  in  on  the 

last  LoadB  cycle  into  the  request 

latch  in  the 

router.     Data  read  from  it   is  the 

grant  output 

of  the  router. 

7 

zero 

Yes 

Always  zero.  Writing  to  Zero  has 

no  effect . 

The  effect  of  writing  non-writable  flags  is  undefined,  and  reserved  for 
future  expansion. 
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Chip  Latches: 


This  section 
discussed  in 


is  a  quick  reference  section, 
the  rest  of  this  document. 


These  latches  and  busses  are 


Control  of  the  ALU  Latches  is  as  follows: 


Name 
A 


F 

Cond 


A I  u-sum 

A I  u-Carry 
Cube-I n 


I/O 

Edge- I atch 
Cube-se I 

CS 


Bus  i  n 
I 


rug 


M  16 

Cube-In  16 
Route  bus  16 
M 

Rug 

FLAG 

(xor 
Flag 


Number    Loading  Clock 
16  (loadl) 

16  ((or  loadA 

rug-w-c) ) 
16  (rug-r-a) 

(LoadB) 

uand  I  oadA  BSe  1=1 
( (and  loadA  BSe 1=1 
16  ((or  loadA 

loadl 
rug-w-a) ) 
( rug-r-c) 

16  (loadA) 


I 


8 


Constant8 
I  8 
M  16 
cube  pins  16? 


I/O  pin  1 
I<7>  1 
See  store  op 
4 

I ed-CS-pi  n 
1 


i  nv) 

16 

( loadB) 

CS 

16 

(loadl) 

1s 

16 

( rug-r-c 

0s 

16 

( rug-r-a 

M 

16 

((or 

Comments 

; ;The  A  Latch  i  s  the 
; ;  A  i nput  to  the  ALU 

;On  rug-w-c, 

;  the  A  latch  is  written  by  memory. 
;0n  rug-r-a  the  A  latch  is  written  by  the  rug. 
. ;B  input  to  ALU 
Wh  i  ch-Dims-Send19) 
Which-Dims-Send=19) 

;For  rewriting  to  memory  if 
it   is  a  conditional  operation 
AND  the  condition  failed. 
;It  is  also  used  for  accessi 

the  rug. 
Flag  i  nput  to  ALU .  One 

for  each  processor. 
Input  to  ALU: 
Cond i t i ona I i zat i on  bits 
;The  INV  signal   is  in 

the  instruction. 
;This   is  for  direct  writing 


i  ng 


LOADB 

Rug-News) ) 
( loadb) 

((or  rug-r  loadi)) 
(LoadA) 

(or  Rug-News  rug-cube) 

(And  LatchR 

(not  rug-news) 
(not  route)) 

(LoadA) 

(Store) 

(store) 

(or  loadi 
read 
rug) 


;Data  from  processors  to  router 


ALU  control  for  sum  output. 
This  is  for  writing  memory 
ALU  control   for  carry  output 


;used  in  the  news  circuit 

;used  in  the  news  circuit 
;This  is  the  Chip  Select  line. 


Other  Latches  not   in  the  Rug. 

(the  ones  in  the  rug  have  been  described 


in  the  rug  section) 


; EMPTYP 

1 

Router 

;RBO 

8 

Route  r 

;RBI 

8 

Router 

;RBP 

8 

Router 

Router  Empty  bit 

Router  data  going  to  memory  for  buffering 
Router  data  going  to  router  from  buffering 


rcouter  aata  going  to  router  from  buffering 
Router  data  coming  from  memory  from  buffering 
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The  Basic  Cycles 

Each  time  a  pulse  is  applied  to  the  Clock  pin,  a  cycle  is  executed  (see 
clocking  section).  The  OP  Pins  and  the  Instruction  Pins 
determine  which  type  of  cycle  is  executed.     This  section  describes  the 
different  OP  cycles  and  their  effect  on  the  state  of  the  chip  and  the 
dr i v i ng  of  the  pins. 

There  are  eight  possible  cycles,  as  shown  in  the  table  below.     The  RUG 
cycle  is  further  decoded  into  six  different  operations  RUG-W-A, 
RUG-W-C,  RUG-R-A,  RUG-R-C,  RUG-NEWS,  RUG-CUBE. 


OP 

0 

1 

2 

3 

4 

5 

6 

7 

NAME 

NOP 

LOADA 

LOADB 

STORE 

READ 

LOAD  I 

ROUTE 

RUG 

10 

DC 

FLAGR0 

COND0 

FLAGW0 

Z 

DATA0 

CYCLE0 

R/W 

1 1 

DC 

FLAGR1 

COND1 

FLAGW1 

z 

DATA1 

CYCLE1 

A/C 

12 

DC 

FLAGR2 

COND2 

FLAGW2 

z 

DATA2 

CYCLE2 

NEWS 

13 

DC 

BSEL 

INV 

MB0 

z 

DATA3 

CYCLE3 

MB0 

14 

DC 

ALUC0 

ALUS0 

MB0 

z 

DATA4 

CYCLE4 

REG0 

15 

DC 

ALUC1 

ALUS1 

MB0 

z 

DATA5 

CYCLE5 

REG1 

16 

DC 

ALUC2 

ALUS2 

MB0 

z 

DATA6 

CHECK0 

REG2 

17 

DC 

ALUC3 

ALUS3 

EDGE 

z 

DATA7 

CHECK 1 

REG3 

18 

DC 

ALUC4 

ALUS4 

CUBE0 

z 

DATA8 

XOR1 

REG4 

19 

DC 

ALUC5 

ALUS5 

CUBE1 

z 

DATA9 

XOR2 

MB0 

110 

DC 

A  LUC  6 

ALUS6 

CUBE2 

z 

DATA 10 

XOR3 

MB0 

111 

DC 

A  LUC  7 

ALUS7 

CUBE3 

z 

DATA1 1 

X0R4 

MB0 

112 

DC 

MB0 

MB0 

MB0 

z 

DATA 12 

SNARF0 

MB0 

113 

DC 

MB0 

MB0 

MB0 

z 

DATA 13 

SNARF1 

MB0 

114 

DC 

MB0 

MB0 

MB0 

z 

DATA 14 

SNARF2 

MB0 

115 

DC 

PMODE 

PMODE 

PMODE 

z 

DATA 15 

ODD 

PMODE 

116 

DC 

PARITY 

PARITY 

PARITY 

z 

PARITY 

PARITY 

PARITY 

DC 

means  Don '  t- 

-care.  I 

MB0  means 

must  be 

0.  z 

means  high  impedence. 

On  each  cycle,  except  NOP,  the  116  is  used  for  an  even  parity  bit  (if 
the  data  is  all  zero,  the  parity  bit  will  be  zero)  computed  across  10 
through  115.    We  use  even  parity  in  order  to  catch  the  all  ones  error, 
which  is  more  likely  in  this  system.     A  typical    instruction  sequence 
consists  of  a  sequence  of  several  cycles.     These  are  documented  in  the 
Sequences  section. 

Sendr  and  Latchr  are  two  pins  to  the  chip  that  are  independent  of  the 
other  cycles,  except  for  Route,  Rug-Cube  and  Rug-News  cycles.     At  any 
other  time  Latchr  can  be  asserted  and  will  change  some  state  of  the 
chip.     If  Latchr  is  asserted  during  a  Route,  Rug-Cube,  or  Rug-News 
cycle,  the  results  are  not  defined. 

CYCLE  DEFINITIONS: 

This  section  describes  what  state  changes  for  each  of  the  different  op  codes. 
This  also  lists  what  pins  are  used  and  unused. 

NOP  CYCLE  (OP  =  0) 

The  chip  does  not  change  state,  (unless  latchr  is  asserted).  No  errors  conditions 
are  checked. 

The  I/O,  Memory,  Error,  and  Instruction  pins  are  undriven. 
LOADA  CYCLE  (OP  =  1) 

The  ALU-CARRY  Latch  is  loaded  from  the  ALUC  field.     The  F  Latch  is  loaded 
from  the  flag  specified  by  the  FLAGR  field.     The  A  Latch  and  C  latch  are 
loaded  from  the  Memory  Pins.     The  I/O  latch  is  loaded  from  the  I/O  pin. 
If  BSEL  is  active,  then  the  B  latch  is  loaded  from  one  of  two  busses. 
If  Wh i ch-D ims-Send  =  19  then  the  B  latch  is  loaded  from  the  route  bus 
otherwise  it  is  loaded  from  the  Cube-In  latches. 

If  the  PMODE  bit  of  the  instruction  or  the  Parity-p  bit  of  the  rug  is 
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active,  then  the  cycle  is  executed  in  Parity  Mode,  otherwise  the  cycle 
is  execute  in  ecc  mode. 

The  I/O,  LED-CS,  Meniory,  and  Instruction  pins  are  undriven.  Error  is 
driven  if  there  is  an  error. 

LOADB  CYCLE  (OP  =  2) 

The  ALU-SUM  Latch  is  loaded  from  the  ALUS  field.     The  COND  Latch  is  loaded 
from  the  XOR  of  the  one-bit  INV  field  and  the  flag  specified  by  the  COND 
field.     The  B  Latch  is  loaded  from  the  Memory  Pins.     The  R  Latch  is 
loaded  from  the  Memory  Pins. 

If  the  PMODE  bit  of  the  instruction  or  the  Parity-p  bit  of  the  rug  is 
active,  then  the  cycle  is  executed  in  Parity  Mode,  otherwise  the  cycle 
is  execute  in  ecc  mode. 

The  I/O,  Memory,  LED-CS  and  Instruction  pins  are  undriven.     Error  is  driven  if 
there  is  an  error. 

STORE  CYCLE  (OP  =  3) 

The  behavior  of  the  STORE  cycle  depends  on  the  state  of  the  COND 
latches.     For  each  processor,   if  the  COND  latch  holds  a  one,  then  the 
flag  specified  by  the  FLAGW  field  is  written  by  the  Carry  output  of 
the  ALU  and  the  Memory  Pin  for  that  processor  is  driven  from  the 
output  of  the  flipper  for  that  processor.     If  the 

COND  latch  for  a  processor  is  zero,  then  no  flag  is  written  and  the 
Memory  Pin  for  that  processor  is  driven  from  the  corresponding  C  latch. 

If  the  PMODE  bit  of  the  instruction  or  the  Parity-p  bit  of  the  rug  is 
active,  then  the  cycle  is  executed  in  Parity  Mode,  otherwise  the  cycle 
is  execute  in  ecc  mode. 

Other  actions  of  the  STORE  cycle  are  independent  of  the  state  of  the 
COND  latch. 

The  flipper  output   is  derived  by  permuting  the  Sum  output  from  the  ALU 
according  to  the  FLIP0  and  FLIP1   registers  in  the  RUG.     The  NOR  of  the 
carry  outputs  from  the  ALUs  is  latched  into  the  Global-latch  latch.  The 
EDGE  Latch  is  loaded  from  the  EDGE  field.     The  Cube-Sel  Latch  is  loaded 
with  the  XOR  of  the  CUBE  field  in  the  instruction  and  the  CUBE-SEL-XOR 
latches  in  the  RUG.     The  output  of  the  router  is  loaded  into  the  RBO  latch 
and  the  one-bit  EMPTYP  latch.     The  RBP  latch  is  loaded  into  the  RBI  latch. 

The  Instruction  pins  are  undriven.     The  LED-CS  Pin  is  driven  from  the 
Global  signal.     The  I/O  Pin  is  driven  with  the  output  of  the  Global-latch 
latch.     The  Global  Signal   (used  on  the  I/O  pin)   is  the  output  of  the 
Global— latch  latch.     Error  is  driven  if  there  is  an  error. 

READ  CYCLE  (OP  =  4) 

The  chip  does  not  change  state  on  this  cycle,  unless  there  is  an  error 
while  the  chip  is  selected. 

If  the  CS  pin  is  asserted,  the  Instruction  pins  are  driven  with  the 
unpermuted  data  read  from  the  M-PINS,  otherwise  they  are  undriven.  The 
I/O,  Memory,  and  CS-LED  pins  are  undriven.     The  Error  Pin  is  driven  if  there  is 
an  error  and  the  chip  has  cs  enabled.     CS  latch  is  latched. 

If  the  Parity-p  bit  of  the  rug  is  active,  then  the  cycle  is  executed 
in  Parity  Mode,  otherwise  the  cycle  is  execute  in  ecc  mode. 

LOADI  CYCLE  (OP  =  5) 

The  C  latch  is  loaded  from  the  Memory  Pins.     The  A  latch  is 
loaded  from  the  Instruction  Pins.     The  ALU-Sum  latch  is 
loaded  with  a  constant  that  will   force  the  ALU  sum  output  to 
produce  the  A  input.     The  COND  Latch  is  loaded  with  ones  if 
the  LED-CS  pin  is  asserted,  otherwise  with  zeros.  CS  latch  is 
I atched . 
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If  the  Parity-p  bit  of  the  rug  is  active,  then  the  cycle  is  executed 
in  Parity  Mode,  otherwise  the  cycle  is  execute  in  ecc  mode. 

The  I/O,  Memory,  CS-Led  and  Instruction  pins  are  undriven.     Error  is 
driven  if  there  is  an  error. 

ROUTE  CYCLE  (OP  =  6) 

The  router  is  advanced  one  step,  according  to  the  instruction  bits,  as 
described  in  the  Router  section.  The  input  from 

the  router  is  read  from  the  R  latch,  the  RBI   latch  and  the  Cube  latch 
The  output  of  the  router  changes  but  is  not  loaded  into  the  RBO  latch. 

The  RBP  latch  is  loaded  from  Memory  pins  8  through  15  (if  the  ODDP  field 
is  one)  or  Memory  pins  0  through  7  (if  the  ODDP  field  is  zero).  Memory 
pins  0  through  7  (if  the  ODDP  field  is  one)  or  Memory  pins  8  through  15 
(if  the  ODDP  field  is  zero)  are  driven  from  the  RBO  latch. 

This  cycle  is  always  executed  in  Parity  Mode. 

The  I/O  and  LED-CS  pins  are  driven  from  the  OR  of  the  MSG-P  latches. 
The  Instruction  pins  are  undriven.     Error  is  driven  if  there  is  an  error 


RUG  CYCLE  (OP  =  7) 

The  Rug  instruction  is  used  for  reading  or  writing  the  rug. 

If  the  PMODE  bit  of  the  instruction  or  the  Parity-p  bit  of  the  rug  is 
active,  then  the  cycle  is  executed  in  Parity  Mode,  otherwise  the  cycle 
is  execute  in  ecc  mode.  CS  latch  is  latched. 

One  of  five  things  happens  on  this  cycle,  depending  on  the  R/W  and  A/C 
bits: 

RUG-W-A  (R/W=0  A/C=1  NEWS=0) 

The  rug  register  specified  by  the  Reg  field  is  written  by  the  A  latch 
The  memory-pins  are  written  to  the  C  latch. 

RUG-W-C  (R/W=0  A/C=0  NEWS=0) 

The  rug  register  specified  by  the  Reg  field  is  written  by  the  C  latch 
The  memory-pins  are  written  to  the  A  latch. 

RUG-R-A  (R/W=1  A/C=1  NEWS=0) 

The  rug  register  specified  by  the  Reg  field  is  read  into  the  A  latch.  The 
C  latch,  permuted  by  the  flipper,  drive  the  memory-pins.     ALU-SUM  Latch  is 
loaded  with  :A,  so  that  the  Sum  output  of  the  alu  will   read  the  A  Latch 
(this  is  needed  for  rug-r-c,  but  is  also  here  anyway).     The  COND  Latch  is 
I oaded  wi th  a  zero. 

it   looks  like  we  dont  need  to  disturb  the  alu-sum  or  cond  latch  here  because 
we  hop  over  the  ALU  anyway  in  the  implementation,  -brewster  and  Dave 
Douglas  8/25/86 

RUG-R-C  (R/W=1  A/C=0  NEWS=0) 

The  rug  register  specified  by  the  Reg  field  is  read  into  the 
C  latch.     The  memory-pins,  permuted  by  the  flipper,  are 
written  by  the  A  latch.  The  ALU-SUM  Latch  is  loaded  with  :A, 
so  that  the  Sum  output  of  the  alu  wi I  I   read  the  A  Latch.  The 
COND  Latch  is  loaded  with  a  one. 
;it   looks  like  we  dont  need  to  disturb  the  alu-sum  or  cond  latch  here  because 
;we  hop  over  the  ALU  anyway  in  the  implementation,  -brewster  and  Dave 
; Doug  I  as  8/25/86 

RUG-NEWS  (R/W=1  A/C=0  NEWS=1 ) 


The  R  latch  and  the  Cube-In  latches  are  loaded  from  Memory.     Latches  A,  B, 
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C  are  preserved. 

RUG-CUBE  (R/W=1  A/C=1  NEWS=1 ) 

The  Cube-In  latches  are  loaded  from  Memory.     Latches  A,  B, 
C  and  R  are  preserved. 

Other  combinations  of  R/W  A/C  and  NEWS  are  reserved. 

Instruction,  global  are  undriven.     Based  on  r/w  the  memory  is  driven  or 
undriven.     Error  is  driven  if  there  is  an  error. 


fl:  )crn)defs)f2BB)ch  ip-spec  if  icq t  ion.  I  isp.139     1/19/87  19:55:52     page  IB 


SEQUENCES  SECTION 

An  operation  on  a  bit  in  memory  takes  more  than  one  cycle.     The  op  codes 
are  grouped  in  sequences  for  different  purposes.     Due  to  the  fine 
granularity  of  the  instruction  set  for  this  chip,  a  great  deal  of 
flexibility  is  gained.     There  are  many  other  potential  sequences  than  the 
ones  listed  here.     These  are  meant  to  be  the  common  ones  and  will 
suffice  for  most  code.    All  chip  features  are  doable  with  these  sequences. 
If  a  programmer  needs  to  try  to  bum  cycles  then  other  sequences  might  be 
used.  The  common  sequences  are: 


LOADB,  LOADA,  STORE 

LOAD I,  STORE 

LOADB,  LOADA,  RUG-W-A,  STORE 

LOADB,  LOADA,  STORE,  ROUTE 
READ 

RUG-W-A,  RUG-W-C 

RUG-R-A,  RUG-R-C 

LOAD  I,  RUG-R-C 

LOAD  I,  RUG-R-A 

LOAD  I,  RUG-W-A 

LOADA,  LOAD I,   LOADB,  STORE 

STORE,  STORE,  STORE,  STORE 

STORE,  STORE,  RUG-R-A,  RUG-R-C 

LOADB,  (LOADA  STORE) * 

LOAD  I,  (RUG-R-C)* 

LOADB,  (LOADA,  STORE)* 

LOADA,  (LOADB,  STORE)* 


A  two  address  arithmetic  operation 
A  direct  memory  write  to  selected  chip 
A  three  address  arithmetic  operation 
A  communications  step 
A  direct  memory  read 
A  move  from  memory  to  rug  register 
A  move  from  rug  register  to  memory 
A  direct  memory  write  to  all  chips 


memory 

rug  register 

ope  rat  i  on 

sequence 


Copy  memory  and/or  flip 
An  immediate  write  to  a 
An  immediate  arithmetic 
A  2-D  NEWS  transmission 
D  i  rect  Cube  Wr  i  t  i  ng 
Single  instruction  arithmetic 
Clearing  memory 

Unconditional   two-address  1  operand  (lognot  or  move) 
Uncond  two-address  1  operand  with  changing 
alu-sum  instruction  ( I ogxor-immed i ate) . 


LOADB  LOADA  STORE: 
This  is  the  normal  sequence  for  conditional   two-address  arithmetic 
or  unconditional  three-address  arithmetic.     If  the  operation  is 
conditional,  then  the  memory  address  in  the  LoadA  cycle  must  be 
the  same  as  the  store  cycle.     If  the  operation  is  unconditional 
(condition-flag:  zero,  cond-invert:  1)  then  this  sequence  can  be  used  as 
a  three-address  instruction  where  the  store  cycle  has  the 
destination  memory  address. 

LOAD I  STORE 

This  is  the  normal  sequence  for  direct  memory  write  to  a  chip  selected  by  CS. 
In  other  words  the  data  to  be  written  in  the  selected  chip  is  put  on  the 
instruction  lines.     All  chips  load  the  instruction  bus  into  their  A  latch  and  a 
memory  location  into  the  C  latch  on  the  Loadl.     Also,  the  condition  is  setup  on 
the  selected  chip  (latched  from  the  CS  pin  on  the  Loadl)  so  that  the  A  latch  is 
brought  to  the  SUM  output  of  the  ALU,  and  on  all  other  chips  the  C  latch  is 
brought  to  the  SUM  output.     Thus  on  the  store,  either  the  C  and  A  latch  to 
memory . 

LOADB  LOADA  RUG-W-A  STORE 
This  is  the  normal  sequence  for  conditional   three-address  arithmetic. 
The  Rug-W-A  and  the  Store  must  have  the  same  addresses,  and  the  Rug-W-A  will 
load  the  C  latch  from  memory.  The  Rug  Register  will,  typically,  be  Sink. 

LOADB  LOADA  STORE  ROUTE 
This  is  the  normal  sequence  for  routing.     The  memory  address  in  the 
LoadB  is  sent  to  the  router.     The  LoadA  has  its  BSEL=ROUTER. 
The  router  data  comes  in  the  B  latch  to  the  ALU. 

READ 

This  is  the  normal  sequence  for  direct  memory  read. 
RUG-W-A  RUG-W-C 

This  is  the  normal  sequence  for  writing  to  a  Rug  register  from  memory. 
The  RUG-W-A  loads  the  C  latch  from  memory  so  the  RUG-W-C  can  write  the 
data  into  specified  register.    Writes  to  multiple  rug  registers  can  be 
pipelined  by  alternately  latching  A  and  C  while  writing  the  rug  from  C 
and  A.  This  can  be  used  to  write  N  rug  registers  in  N+1  cycles. 


RUG-R-A  RUG-R-C 
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This  is  the  normal  sequence  for  reading  from  a  rug  register  to  memory. 
The  RUG-R-A  loads  the  C  latch  from  the  rug  so  the  RUG-R-C  can  write  the 
data  into  memory.     Reads  from  multiple  rug  registers  can  be  pipelined  by 
alternately  latching  A  and  C  while  writing  the  memory  from  C  and  A.  This 
can  be  used  to  read  N  rug  registers  in  N+1  cycles. 


LOAD I  RUG-R-C 
This  is  the  norma 


I  sequence  for  direct  memory  write  to  all  chips. 


LOAD I  RUG-R-A 

This  sequence  can  be  used  to  copy  memory  from  one  address  to  another  on 
all  chips.     It  can  be  used  to  flip  memory,  since  the  LOADI  sets  the  C 
latch  with  the  unf lipped  source  and  the  RUG-R-A  writes  the  C  latch 
through  the  flipper  to  the  destination. 

LOADI  RUG-W-A 

This  is  the  normal  sequence  for  immediate  write  to  a  Rug  register  in 
all  chips.     The  LoadI   loads  the  A  latch  so  the  Rug  can  accept  the  A  for 
the  Rug  Write  operation.     The  C  latch  is  loaded  from  the  memory  location 
in  the  Rug  cycle.  Additional   rug  registers  may  be  loaded  with  the 
same  constant  with  additional  RUG-W-A  cycles. 

LOADA  LOADI  LOADB  STORE 
This  is  the  normal  sequence  for  conditional   two-address  or 
unconditional  three-address  immediate  arithmetic,  with  potentially 
different   immediate  values  for  each  processor  on  a  chip.  The 
BSel=Router  cannot  be  used  in  this  case.     The  address 
specified  by  the  I oadA  does  not  matter.     The  LoadI  address  should  have 
the  same  address  of  the  store  if  the  operation  is  two-address  conditional 


Why  this  instruction  works: 


A 
B 
C 
F 

Cond 
A I u-Sum 
A I u-Ca  r  ry 


LoadA 
memory 

memory 
f  lag 


i  nst  r 


LoadI 
I  nst  r 

memory 

CS 

mashed 


LoadB 
memory 


f  lag 
Inst  r 


Store 

i  nst  r(Loadl) 
memory ( I oadB' 
memory ( I oadl 
f  I  ag( I oada) 
f lag(loadb) 
Inst  r( I oadb) 
Inst  r( I  oada) 


What  is  the  sequence  for  unconditional  two-address  immediate  arithmetic?  -brewster  and 


KB0 


STORE  STORE  STORE  STORE 

This  is  a  normal  2-D  NEWS  transmission 
have  6  store  cycles,  a  4-D  would  have  8 
sequence  the  Cubel  field  is  incremented 
sequence,  each  processor  on  the  externa 
sequences  can 
cycle.  Sendr 


2.  NEWS  transmission 
sequence  to  save  one 


store  cycle,  ♦♦♦♦more 
news  document  for  how 


sequence.  The  3-D  transmission  would 

and  1-D  would  only  have  1.  During  each 
and  the  CUBEO  field  specified,  in 
boundary  of  the  chip  as  shown  in  Table 
be  overlapped  with  a  load-load-store 
and  Latchr  must  be  strobed  on  or  after  each 


on  sendr  latchr.  This  section  is  no  good.  Please  see  the 
to  do  news.     I  hope  that  is  better,  -brewste r***» 


STORE  STORE  RUG-R-A  RUG-R-C 


[*♦♦*  looks  wrong  to  me****  — danny] 

This  is  a  normal  sequence  for  direct  cube  writing.     This  sequence  can  be 
overlapped  with  a  load-load-store  sequence  to  save  one  cycle.     Sendr  and  Latchr 
must  be  strobed  on  or  after  each  store  cycle.     The  data  is  read  from  the 
cube-in  latch  on  the  rug-r-a  cycle  and  written  to  memory  on  the  rug-r-c 
cycle.     ♦**more  on  sendr  latchr 

LOADB  (LOADA  STORE) ♦ 
This  is  a  normal  sequence  for  single  instruction  arithmetic.     The  LoadB 
loads  the  context  and  ALU  Sum  function.     This  only  need  happen  once  for 
the  whole  field  since  a  second  input  to  the  ALU  is  not  needed  for  single 
instruction  arithmetic.     Such  single  instruction  arithmetic  is 
complement,  or  complement  based  on  another  bit  (loaded  during  the 
LoadB). 

LOADI  (RUG-R-C)* 

This  is  a  normal  sequence  for  unconditional  clearing  of  memory.  The 


fi:)cm)defs)f2B0)chip-specif ication.l  isp.139     1/19/87  19:55:52     page  12 


LoadI   is  used  to  set  up  the  function  and  data  and  the  Rug-R-C's  write  the 
same  data  everywhere.     This  can  be  used  for  setting  memory  to  an  arbitrary 
constant . 

LOADB  (LOAD A  STORE)* 

This  is  the  normal  sequence  for  unconditional  two-address  instruction  that  have  only 
one  operand.     The  only  instructions  that  have  this  property  are  lognot  (invert  a  field) 
and  move.     Move  is  accomplished  faster  by  LoadI  Rug-W-A. 

LOADA,   (LOADB,  STORE)* 
Uncond  two-address  1  operand  with  changing  alu-sum  instruction.  The 
only  instruction  that  needs  this  is  logxor-immediate.     This  is  done  by 
the  instruction  ALU  field  is  in  the  loadB  instruction.     This  can  be 
instructed  to  toggle  the  B  input  to  the  ALU  or  not  based  on  a  bit  in  the 
microcontroller.     This  is  incredibly  obscure  and  only  included  for  hack 
va I ue . 
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Error  Signal i ng 

Whenever  any  error  occurs,  the  syndrome  register  is  written  with  the  memory 
error  syndrome  (the  XOR  of  the  six  stored  ECC  bits  and  the  six  computed  ECC 
bits)  and  the  error-code  register  is  written  with  a  number  indicating  the 
typeof  error  as  shown  in  table  IX.     The  error-code  and  syndrome  registers 
remain  set  until  written  again,  either  explictly  or  as  a  consequence  of 
another  error. 

The  error  pin  is  driven  whenever  an  error  is  signalled  and  the  Enable-error 
register  is  enabled.     All  errors  are  signalable  if  the  SIGNAL-ALL-ERRORS 
register  is  high.  All  errors  except  single  bit  memory  errors  in  ecc  mode 
(error  code  4)  are  signalable  if  the  SIGNAL-ALL-ERRORS  register  is  low. 


Note:   If  an  error  occurs  while  reading  or  writing  a  rug  register  that 
is  automatically  changed  by  the  error  system,  the  automatic  change  will 
not  take  place  in  that  register.  This  is  prevents  a  register  from 
changing  while  it  is  being  referenced,  which  could  lead  to  unpredictable 
resul ts. 


Table  IX 


COMMENT 

"Message  Parity  Error") 

"Instruction  Parity  Error") 

"Memory  Parity  Error  (in  parity  mode)") 
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RUG  SECTION 

The  RUG  stands  for  Chip  Status  Register  (RUG).     It  holds  many  of  the 
bits  in  the  chip  that  are  not  directly  associated  with  processors.  This 
means  chip  oriented  state  such  as  error  status,   router  modes,  flip 
registers,  etc.     These  bits  are  clumped  into  registers  for  fast  access. 
They  can  be  read  and  written  by  using  the  RUG  instruction  (see 
instruction  section). 

There  is  a  pipeline  that  allows  a  fast  read  and  write  rate  from 
memory.    On  a  rug  instruction  the  a-c  bit  allows  the  rug  to  be  read  or 
written  (based  on  another  bit)  to  come  from  the  A  or  the  C  latch.  The 
latch  that  is  not  going  to  (or  from)  the  rug  is  written  by  (or  to) 
memory.     This  allows  one  to  write  the  rug  with  one  latch  while  loading 
up  the  other  with  the  next  data  to  stuff  into  the  rug.     This  speed  is 
only  crucial  on  some  router  operations. 


RUG  REGISTERS 


These  are  read/writable  unless  otherwise  noted, 
all  signals  are  active  high  unless  otherwise  noted 
setup  is  all  active  unless  otherwise  noted, 
number  of  bits  per  name  is  1  by  default. 

(defvar  spec) 

(defconst  cm : *ch i p-rug-bi ts* 


'( 


(0 


to  meta  poi  nt  **temp 
interpreted  by  (cme:parse- 


rug-regi  sters) 


Router  rug  Locations  to  be  stored 
All  setup  is  Don't-care 
: empty-p 
( :  empty-p 


on  trace,   reloaded  on  read 


(1 


(2 


( : e j  ect-p 


:  request 
( : request 


:Match-box 
( : Match-box 


; ; i  n  snarf 

: s  i  ze  cm: *ch  i  p-number-of-buf  f e  rs-l imi  t * 
:setup  :enabled 
:type  :spy) 

: s  i  ze  cm: *ch  i  p-numbe  r-of-comb  i  ned-buf  f e  rs-l im  i  t * 
:setup  :enabled 
:type  :status)) 


:type  :  spy  ;; loaded  from  rug  or 
;; route  ops.  (see  injector  for 
: setup  : DONT-CARE 

:size  cm: *ch i p-number-of-processors-l imit*)) 


router  computation  on  some 
further  detai  I s) 


loaded  from  rug  or  router  computation  on  some 


type  :spy 
; ; route  ops 

:size  cm: *chi p-number-of-match-boxes*))  ;15 

0(cme: heart-rug-state  3) 

this  is  a  list  of  rug  locations  for  each  dimension  (therefore  12  locations). 
Each  rug  register  has  14  bits  (2  for  each  buffer  assuming  7  buffers) 
the  LSB  is  the  vavle-l  bit  for  the  top  buffer.  The  merge-l  bit   is  next 
The  setup  for  each  is  Don't  care  because  the  router  will  figure  it  out  the 
first  time  it  is  used.     These  can  be  loaded  from  the  rug,  or  they  can  be 
determined  by  the  router  on  route  ops.  Assuming  7  buffers  and  12  dimensions, 


:DIMENSION-0  ( : DIMENSION-0  :SIZE  14  :SETUP  :enabled  : TYPE  :SPY) 
:DIMENSI0N-1  ( :DIMENSION-1  :SI2E  14  :SETUP  :enabled  : TYPE  :SPY) 
:DIMENSI0N-2  ( .-DIMENSION-2  :SIZE  14  :SETUP  :enabled  : TYPE  :SPY) 
:DIMENS10N-3  ( :DIMENSION-3  :SIZE  14  :SETUP  -.enabled  : TYPE  :SPY) 
.-DIMENSION-*  ( :DIMENSION-4  :SIZE  14  :SETUP  :enabled  : TYPE  :SPY) 
:DIMENSI0N-5  ( :DIMENSION-5  :SIZE  14  :SETUP  :enabled  : TYPE  :SPY) 
:DIMENSI0N-6  ( :DIMENSION-6  :SIZE  14  :SETUP  -.enabled  : TYPE  :SPY) 
:DIMENSI0N-7  (  :DIMENSI0N-7  :SIZE  14  :SETUP  :enabled  :TYPE  :SPY) 
:DIMENSI0N-8  ( :DIMENSION-8  :SIZE  14  :SETUP  :enabled  :TYPE  :SPY) 
:DIMENSI0N-9  ( :DIMENSION-9  :SIZE  14  :SETUP  :enabled  : TYPE  :SPY) 
:DIMENSION-10  ( : DIMENSION-1 0  :SIZE  14  :SETUP  :enabled  : TYPE 
:DIMENSI0N-11  (: DIMENSION-1 1  :SIZE  14  :SETUP 
0(cme : ce I l-address-rug— state  15) 
These  can  be  loaded  from  the  rug  or  the  router 
:Ce I l-address  24  Bits,  bits  <3:0>  are  from 

buffer  0,  etc. 

(15  : CELL-ADDRESS-0  ( : CELL-ADDRESS-0  :SIZE  16  : SETUP  : DONT-CARE 
(16  : CELL-ADDRESS- 1   ( : CELL-ADDRESS- 1    :SIZE  08  :SETUP  : DONT-CARE 


3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 


SPY)) 

enabled  :TYPE  :SPY)) 


:TYPE 
:  TYPE 


:SPY) 
:SPY) 
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(17  :  f I  i  p0 

(  :  f I i  p0 


(18  :flip1 
(:fliP1 

(19  :error 

( :  s  i  gna l-a I l-er  rors 

( : Er ror-Enob I e 

( :Global-Enoble 

( : Led-Enabl e 

( : Syndrome 

( : Er  ror-Code 
) 

(20  :message-p 

( : f orward-message-p 


;  ; I  east  s  i  gn  i  f  i  cant 

: s  i  ze  16. 

: type  :status)) 


bits,  stages  1  and  2 


( :  backward-message-p 


;  ;most  significant  bits,  stages  4  and  8 
: s  i  ze  16. 
:type  istatus)) 

;setup  :disabled 

:type  :status)       loaded  only  from  rug 
:setup  :enabled  ;enable  assertion  on  pin 
:type  :status)       ;; loaded  only  from  rug 
:setup  :enabled  ;enable  assertion  on  pin 
: type  istatus)   ;;  loaded  only  from  rug 
:setup  :enabled  ;enable  assertion  on  pin 
:type  :status)  ;;loaded  only  from  rug 

;;the  ecc  syndrome  generated  on  the  most 
:size  6  ;; recent  memory  error 

: type  :error)   ;;  loaded  on  error  or  by  rug  op 
:size  (length  cm : *ch i p-rug—e r ro r— codes* )  ;6 
:type  :error)   ;; loaded  on  error  or  rug  op 


;;  the  message  bit  for  each  buffer  going  forward 
:size  cm : *ch i p-numbe r-of-buf f e rs- I i m i t *  ;;7 
:setup  :disabled 
:type  :spy) 

;;  the  message  bit  for  each  buffer  going  backward 
:size  cm: *chi p-number-of-buf f ers-l imi t*  ; ;7 
:setup  :disabled 
:type  :spy) 
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: status 


this  must  be  all   read-only  except  for  parity-p 


that  rug  testing 
-George  10/25/85 
from  rug 
;;enables  parity  mode 


; ;NOTE:  parity-p  is  here  so 
;;     more  straight-forward. 
(:parity-p  :type  :status  ;;loaded  only 

:setup  (if  cm : *par i ty-mode* 

: enab I ed 

: d  i  sab  I ed) ) 

(:pin-test-p  :type  :status  ;; loaded  only  from  rug 

:setup  :disabled) 
( :  i nput-l atch  :type  :spy} 

( :g lobal— I atch  :type  :spy)     ;;  Low  active  to  match  I/O  pin 

; ;Global-latch    NOR  of  (and  condition-bit  carry-output)  for  each 
;;processor  (latched  on  Store) 

(:cs  :type  :read-only)     ;;chip  select  pin  on  the  chip 

; ; h  i  gh  act  i  ve(?) 


(  :  ser  i  a l-number 


:size  4  :type  '.read-only) 
;;Betas  serial  number  is  1 
; ; (bi  t  0  a  1 ,   rest  0) 


(22  : cube-cont ro I 

( :which-dims- receive 
( :which-dims-send 
( : cube-se l-xor 


; ;Sets  up  the  news  direction  latches  for 

;;off  chip  and  on  chip  connections 

;;I  dont  know  the  number 

:size  5     ;;which  direction  of  the  cube 

; ; i  s  to  be  watched 
:type  :status  ;;  loaded  only  from  rug 
:setup  :enabled) 

:size  5     ;;which  direction  of  the  cube 

;  ;  i  s  to  be  sent  out 
:type  :status      loaded  only  from  rug 
:setup  :enabled) 

:size  4     ;;to  be  xor'd  into  the  CUBE  field  of  STORE 
:type  :status  ;;  loaded  only  from  rug 
:setup  :disabled)) 
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(23 


: router-mode 
(:Fop 

( : Backward 
( : Inject-P 

( : desperat  i on-route-p 


(24 


:Bad-Wi  re-Bi  ts 
( :Bad-Wi  re-Bi  ts 


:size  2  ;;bits  of  operation  to  the  combiner 

:setup  idisabled  ;;this  is  pass  A  function 

: type  :status)  ;; loaded  only  from  rug 

:size  1   ;;  bit  for  going  forward  (0)  and  backward(1) 

.•setup  :disabled 

:type  :status) 

:size  cm:*chip-number-of-injectors-l imit*  ;6 
:Setup  : a  I l-enabl ed 
:type  :status) 

: si  ze  cm: *chi  p-numbe r-of-buf fers-l imit* 

;;Bit  0  is  for  the  top  row  of  the 

;;heart.     These  bits  specify  whether  a  row 

;;of  the  router  will  desperation  route. 

:type  :status  ;; loaded  only  from  rug 

:setup  ; ; a  I  I  disabled  except  the  last  buffer 

(append 

(loop  for  buffer  below 

(1-  cm: *ch  i  p-numbe r-of-buf fers-l imit*) 
col lect  ing  :disabled) 

' ( : enab I ed) ) ) )  ;;loaded  only  from  rug 


:type  :status  ;; loaded  only  from  rug 

: s  i  ze  cm: *ch  i  p-numbe r-of-d  i mens  ions* 

:setup  ;; lower  dimensions  where  there  is  a  machine 

;;dependent  on  size  of  the  machine. 

(append  (loop  for  dimension  below 

(-  cm : *ch  i p-numbe  r-of-d  i mens  i  ons* 
(haulong  (1- 

cm: *chi  p-numbe r-of-proces so rs- 1 imit*))) 
col lect  ing  :disabled) 
(loop  for  dimension  from 
(-  cm: *ch  i  p-numbe r-of-d  i mens  i  ons* 
(haulong  (1- 
cm : *ch  i  p-numbe r-of-processors- 1 imit*))) 
be  I ow  cm: *ch  i p-numbe  r-of-d  i mens  i  ons* 
co I  I  ect  i  ng  : enab led) ) ) ) 


(25  :pari ty-errors 

( : f  orwa  rd-par  i  ty 

( : backward-par  i  ty 

(26  : cube-out 

( : cube-out 

(27  :Cube-In 
( :Cube-In 


(28  :heart 

( : heart-out 


( : de I i  ver-p 


(29  :  spread 

( : spread 


(30  :illegal-30 
(:ill ega I -30 


;;parity  errors  in  the  router  buffers 
:size  cm: *chi p-number-of-buf fers-l imi t*  ;;7 
:type  :spy) 

:size  cm: *chi p-number-of-buf fers-l imi t*  ; ;7 
:type  :spy)) 

: s  i  ze  cm : *ch  i p-numbe r-o f-d  i mens  ions* 
: type  : read-only)) 

;;See  chip  latches  section 

:size  cm: *ch i p-number-of-processors-l imi t*  ;16 
itype  : read-only)) 


;;in  router.     This  need  not  be  setup 

;;NOTE:  this  should  not  be  put   in  with  another  field 

;;  which  must  be  preserved  while  writing  this  field. 

;;  (because  LOADA  resets  heart-out.) 

: s  i  ze  cm: *ch  i  p-number-of-buf  fe  rs- 1 imit* 

:type  :spy) 

;;in  router.     This  should  be  disabled  on  setup 
:size  cm :*chi p-numbe r-of-comb i ned-buf fers-l imit* 
:setup  idisabled 
:type  :spy)) 

;; inverse  of  global-latch,  repeated 
;;16  times  (once  for  each  processor) 
:size  16  :type  :reod-only)) 

;;  writing  to  this  will  generate  and  error 
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;  ;   reads  zeros 
: s  i  ze  0 

:type  : i I  legal-address)) 
(31  :Sink  ;;this  can  be  written  with  no  effect 

; ;NOTE:  This  must  be  at  address  31  (error-system). 
(:Sink  ;;Place  to  dump  unwanted  bits  and  read  0's 

: s  i  ze  0 

:type  :address— only)) 


Spread  i  ng 

The  SPREAD  register  of  the  rug  allows  a  value  of  the  inverse  of  the  GLOBAL-LATCH 
latch  (the  global  signal  on  the  most  recent  store  cycle)  to  be  written  to  all  of 
the  processors.  This  is  useful  for  copying  information  among  the  processors  of 
the  chip.     The  value  read  by  the  spread  register  is  not  effected  by 
GLOBAL-ENABLE. 
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The  Fl i  pper 


The  Flipper  is  a  device  for  permuting  the  16  SUM  outputs  of  the 
16  processors  on  the  chip,  before  they  are  written  into  memory.        It  is 
able  to  perform  various  useful  permutations  (as  defined  in  Table  III) 
including  shifting  and  reversing.     The  flipper  permutes  the  alu-sum 
outputs  of  the  alu  according  to  the  permutation  specified  by  Flip0  and  Flipl 
registers  of  the  RUG.     This  permutes  the  data  written  to  memory  on  the 
store  cycles. 

Cond i t i ona I i zat i on  interacts  with  the  flipper  as  follows.  Each 
processor  calculates  its  unpermuted  alu  sum,  without  regard  to  the 
condition.     These  values  are  then  permuted  by  the  flipper.  Each 
processor  will  then  write  either  the  permuted  value  or  the  contents  of 
the  c-latch,  depending  on  the  state  of  the  condition. 

The  action  of  the  flipper  is  determined  by  the  contents  of  the 
Flip0  and  Flipl   registers  of  the  RUG.  Each  bit  controls  one  "exchanger." 
Whenever  the  bit   is  one,   the  exchange  will  exachange  its  two  data  bits. 
Whenever  the  bit   is  zero,   the  exchange  will  pass  its  two  data  bits  without 
permuting  them.  The  exchangers  are  arranged  in  four  stages,  with  the 
output  of  each  stage  feeding  into  the  input  to  the  next,  so  that  the 
total  action  of  the  flipper  is  the  composition  of  the  actions  of  the 
four  stages.     The  correspondence  between  control  bits,  exchangers,  and 
stages  in  shown  in  Table  I.     The  Table  shows  the  effect  of  each  bit  of 
the  register  on  each  to  the  four  stages  of  the  flipper. 

Table  I 

(The  entry  (a  b)  in  with  control  bit  c  and  stage  s  indicates  that  Bit  c 
of  the  Flip  register  controls  an  exchanger  across  data  bit  a  and  b  in 
stage  s.) 


(defconst  cm: *f I i p-bi ts*  '( 
;Cont  ro I 

;Bit        Stage  Exchanger 


(0 
1 
2 
3 
4 
5 
6 

[7 

[8 

(9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 

(28 

[29 


(8  10) 
(9  11 
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(30  3 
(31  3 
)) 

§\ 

Table  III 


15  !!i 


Useful 
I* 

(defvar  cm:*fl 
;these  are  the 
;  name 

IDENTITY 
BACKWARD 
ROT ATE- 14 
ROT ATE- 13 
ROT ATE- 12 
ROTATE- 1 1 
ROT ATE- 10 
ROTATE-9 
ROTATE-8 
ROTATE-7 
ROTATE-6 
ROTATE-5 
ROTATE-4 
ROTATE-3 
ROTATE-2 
FORWARD 
WEST 
EAST 
NORTH 
SOUTH 
W 
X 
Y 
Z 


Values  for  the  flipper 


i  p-usef u 
va I ues 
; f I i  p0 
0 

43775 
65280 
22015 
0 

43775 
65280 
22015 
0 

43775 
65280 
22015 
0 

43775 

65280 

22015 

26367 

39423 

65535 

65535 

255 

65535 

65535 

65535 

#xFFFF 


l-values*  ' 
used  by  the 
;  f  I  i  p  1 
0 

32904 
49356 
57582 
61695 
63607 
64563 
65041 
65280 
32648 
16332 
8174 
4095 
1911 
819 
273 
0 
0 

61695 
4095 
0 
0 

255 
65535 
#xFFFF 


( 

diagnostics  to  test  the  chip 
pe rmutat  i on 

7 
8 
9 


1 
2 
3 
4 
5 
6 
7 
8 

9  10  11   12  13  14  15  0 

10  11   12  13  14  15  0  1 


3 
4 

5 
6 
7 
8 

9  10  11   12  13  14  15  0 

10  11   12  13  14  15  0  1 

1  2 

2  3 


8  9  10  11   12  13  14 

9  10  11   12  13  14  15 

10  11  12  13  14  15  0 
10  11   12  13  14  15  0 

11   12  13  14  15  0  1 


10  11   12  13  14  15  0  1 


10  11   12  13  14  15  0  1  2  3  4 

11  12  13  14  15  0  1  2  3  4  5  6  7  8  9 

12  13  14  15  0  1  2  3  4  5  6  7  8  9  10 

13  14  15  0  1  2  3  4  5  6  7  8  9  10  11 

14  15  0  1  2  3  4  5  6  7  8  9  10  11  12 

15  0  1  2  3  4  5  6  7  8  9  10  11  12  13 
1  2  3  0  7  4  5  6  9  10  11  8  15  12  13 

2  5  6  7  4  11  8  9  10  13  14  15 
4  11   10  9  8  15  14  13  12  3  2  1 
2107654  11   10  9 


3  0  1 
7  6  5 
15  14  13 
1  0  3 
3  2  1 
7  6  5 
15  14  13 
15  14  13 


12 
2  5  4 
0  7  6 
4  3  2 


6  9  8  11   10  13  12  15 
4  11  10  9  8  15  14  13 
0  15  14  13  12  11   10  9 
12  11   10  9  8  7  6  5  4  3  2  1 
12  11   10  9  8  7  6  5  4  3  2  1 


15) 
0) 
1) 
2) 
3) 
4) 
5) 

?J 
Si 

10) 
11) 
12) 
13) 
14) 
14) 
12) 

0) 

8) 
14) 
12) 

:il 

0)) 


fl: )cfn)defs)f28B)ch ip-spec if i cat  ion, 1  isp. 139     1/19/87  19:55:52     page  2B 


PARITY  and  ECC 

There  are  four  error  detection  mechanisms  in  the  chip.  Each  instruction 
has  parity  protection  on  executed  cycles  (not  unselected  read  or  nop). 
Memory  parity  or  ECC  is  always  checked  when  memory  is  read  into  the 
chip.  The  router  can  signal  an  error  if  message  parity  is  not  preserved 
either  going  forward  or  backward.     If  an  error  is  signaled,  and  the 
error  pin  and/or  an  error  recovery  system  is  activated. 

Instruction  Parity  Errors: 

On  all  cycles  except  READ  and  NOP,  the  parity  of  the  instruction  is 
checked.     If  an  error  occurs  an  error  is  signaled.     Even  parity  is  used. 

Memory  Read  Errors: 

At  any  time  the  chip  is  either  in  Parity  Mode  or  in  ECC  mode,  depending 
on  the  Parity-p  bit  in  the  RUG  and  whether  the  route  cycle  is  being  executed. 
When  in  Parity  mode,  Even  parity  is  used. 

The  chip  operates  in  memory  parity  mode  when  the  parity-p  rug  bit  is 
asserted  or  when  the  chip  is  executing  a  route  instruction.     In  parity 
mode  the  ECC  Pin  0  writes  or  expects  to  read  an  even  parity  bit  for  the 
Memory  Pins.     ECC  Pins  1   through  5  are  driven  with  the  same  data  as 
Memory  Pins  1  through  5,  respectively. 

The  chip  operates  in  ECC  mode  when  the  parity-p  rug  bit  is  low. 
The  ECC  (Error  Correcting  Code)  system  on  chip  corrects  all  single  bit 
memory  errors  and  detects  all  double  bit  memory  errors.  Each  ECC  bit  is 
computed  by  XORing  or  XNORing  eight  of  the  Memory  Pins  according  to  the 
table  cm: *ecc— code* . 

While  changing  between  parity  to  ecc ,  the  chip  should  be  referencing  a 
memory  value  that  is  valid  in  both  modes,  for  example  16  (decimal). 

Note  that  memory  checking  and  writing  on  a  ROUTE  cycle  (the  address  is 
the  address  of  a  bit  slice  in  the  router  buffers)  is  a  very  special 
case.     For  ROUTE,    it  always  reads  and  writes  in  Parity  mode,  regardless 
of  the  setting  of   :Parity-p.    Also,  memory  on  ROUTE  cycles  is  written 
and  read  in  halves;  while  one  half  is  written,  the  other  half  is  read. 
There  are  seven  buffers,  so  the  eighth  bit  of  the  half  is  used  to  insure 
that  each  half   is  always  an  even  number  of  bits;  hence  the  parity  bit  is 
always  zero.     For  this  mechanism  to  work  properly,   router  buffers  must 
be  zeroed  in  Parity  mode  before  any  routing  is  done. 

Router  Parity: 

Forward  router  parity  works  as  follows:     On  the  input  of  the  heart  parity  is 
generated  and  can  be  injected  into  the  message.     This  is  checked  on  the 
output  of  the  heart.     Parity  bits  can  be  placed  arbitrarily  often  in  the 
message  and  probably  in  the  address  as  well.     Both  the  address  and  the 
data  are  protected  with  this  scheme.     Since  the  address  changes  as  it 
progresses  from  chip  to  chip,  the  parity  is  also  updated.  This 
protection  gives  multiple  parity  protection  of  a  message  and  the 
address.     This  mechanism  can  also  diagnose  what  wire  went  bad  by 
degrading  the  performance  of  the  petit  cycle  by  5%.     This  is 
accomplished  by  saving  the  state  of  the  heart  after  each  petit  cycle  so 
that  if  an  error  occurred  one  can  examine  the  decisions  made  by  the 
heart  and  find  what  wire  went  wrong.     This  would  track  all  error  that 
occurred  over  wires  during  address  transmission.       If  one  saved  the 
buffer  state  also,  then  the  petit  cycle  could  be  rerun  continuously  to 
try  to  make  the  error  recur. 

Backward  router  parity  is  similar,  except  the  parity  is  placed  on  the 
message  just  after  the  heart,  and  checked  right  before  the  heart.  (this 
view  of  the  heart  is  from  the  forward  view). 

Signal  I ing  an  error: 

When  an  error  occurs  the  error-code  field  in  the  rug  is  loaded  with 
the  type  of  error  on  the  following  cycle  (unless  the  error  register 
happens  to  be  reference  on  that  cycle,   in  which  case  the  error— code  is 
not  recorded).     All  errors,  except  corectable  memory-errors  when 
the  s i gna l-a I  I— e r rors  latch  is  low,  will  also  signal  an  error 
on  the  error  pin  on  the  cycle  following  the  error,   if  the  enable 
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errro  bit  is  high. 
I# 

(def const 
cm:*ECC-CODE* 
'( 


;ECC  PIN     0  1 


Memory  Pin 

2    3    4    5    6    7    8    9    10  11  12  13  14  15 


(XNOR 
(XNOR 
(XNOR 
CXNOR 
XOR 
XOR 


1     110000001  101101' 


0  0 
0  0 


0  111 
0    0    0  0 


0  0 

1  1 


1 


0     0     0  0 


10  110) 
110  11 
0    0    0    0  0) 
1110  0) 


1     0     0     0     0     0     0     1  1 
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ECC0 
ECC1 
ECC2 
ECC3 
ECC4 
ECC5 


; (A  "1"  indicates  that  the  memory  pin  is  used  in  computing  the 
;bits  0,1,2  are  computed  with  XNOR,  bits  3,4,5  are  computed  with  XOR 
; cor  respond i ng  ECC  pin.) 

#l 


Whenever  any  memory  error  (correctable  or  uncorrectable)   is  detected  the 

Syndrome  register  of  the  RUG  is  loaded  with  the  error  syndrome. 

If  a  single  bit  error  is  detected,   it  is  corrected  without  signalling  an 

error  (unless  the  Signal-All-Errors  rug  bit  has  a  1),  and  the 

Corrected-Memory-Error  error  code  is  set  in  the  Error-Code  rug  field. 

If  a  multiple  bit  error  is  detected,  the  Uncor rectab I e-Memory-Er ror 

error  code  is  set  in  the  Error-Code  rug  field.     Any  multiple 

bit  error  that  generates  a  syndrome  with  even  parity  will  be  detected. 

This  includes  all  two  bit  errors. 
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SendR,  LatchR,  Direct  Cube  Write,  and  NEWS 


The  transmission  of  data  between  chips  is  controlled  by  the  SendR  and 
LatchR  pins.    A  chip  with  its  SendR  pin  active  may  drive  data  onto  one 
or  more  of  its  cube  pins.     A  chip  with  its  LatchR  pin  active  may  latch 
data  from  one  or  more  of  its  cube  pins  into  the  corresponding  Cube-In 
latches.     (Note  that  a  RUG  instruction  can  also  load  the  Cube— In 
I  atches . ) 

There  are  three  distinct  mechanisms  from  transmitting  and  receiving 
data:     direct  cube  write,  NEWS,  and  the  router.     These  three  mechanisms 
are  related  and  share  some  parts.     Only  direct  cube  writing  and  NEWS  are 
discussed  in  this  section. 

Direct  cube  writing  allows  processors  0-11  to  read  and  write  cube  pins 
0-11  directly.     In  this  mode  the  data  driven  to  cube  pins  is  taken  from 
the  R  latches  for  processors  0-11,  and  data  from  cube  pins  is  read  into 
the  Cube-In  latches  of  processors  0-11.     (The  LOADA  instruction  with 
BSEL=Router  can  then  transfer  data  from  the  Cube-In  latches  to  the  B 
latches.)    Processors  12-15  do  not  provide  data  for  cube  pins,  and 
Cube-In  latches  12-15  are  always  cleared  in  this  mode. 

The  NEWS  mechanism  may  be  used  for  regular  grid-like  patterns  of 
communication  between  processors.     The  pattern  is  controlled  by  a 
combination  of  the  the  News  and  Flip  registers  in  the  RUG  and  the  4— bit 
Cube-Sel    latch  (which  is  loaded  by  the  STORE  instruction).     The  News 
section  of  the  RUG  contains  the  Which-Dims-Send  register,  the 
Wh i ch-D ims-Rece i ve  register,  and  the  Cube-Sel-Xor  register.     The  Flip 
section  of  the  RUG  contains  32  bits  for  controlling  the  flipper.  These 
registers  must  be  reloaded  each  time  a  different  NEWS  direction  or 
transmission  mechanism  is  to  be  used. 


When  the  SendR  pin  is  active,  the  cube  pins  that  are  enabled  for  driving 
are  selected  by  the  Which-Dims-Send  field  of  the  Cube-Control  register. 
The  action  of  the  Which-Dims-Send  field  is  shown  in  Table  5.     Values  0-11 
are  used  for  NEWS  communication.     Value  19  is  used  for  routing,  and 
value  18  is  used  for  direct  cube  writes. 


I# 


(def const  cm: *wh  i  ch-d  ims-send-encod  i  ng* 
Table  5.  Which-Dims-Send 

Which-Dims-Send  I  Source  of  Data  I  Cube  Wires  Driven 


'( 


0 
1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
(25 


R 

^Cube- 

-Sel ' 

-to- 

-Cube- 

-Pin 

0' 

R 

|Cube- 

-Sel ' 

-to- 

-Cube- 

-Pin 

'1 ' 

R 

|Cube- 

-Sel ' 

-to- 

-Cube- 

-Pin 

'2' 

R 

|Cube- 

-Sel' 

-to- 

-Cube- 

-Pin 

3' 

R 

|Cube- 

-Sel' 

-to- 

-Cube- 

-Pin 

'4' 

R 

|Cube- 

-Sel ' 

-to- 

-Cube- 

-Pin 

'5' 

R 

|Cube- 

-Sel' 

-to- 

-Cube- 

-Pin 

6' 

R 

[Cube- 

-Sel' 

-to- 

-Cube- 

-Pin 

'7' 

R 

'Cube- 

-Sel ' 

-to- 

-Cube- 

-Pin 

'8' 

R 

^Cube- 

-Sel' 

-to- 

-Cube- 

-Pin 

y 

R 

'Cube- 

-Sel' 

-to- 

-Cube- 

-Pin 

'10 

R 

'Cube- 

-Sel' 

-to- 

-Cube- 

-Pin 

'n 

reserved 
rese  rved 
reserved 
reserved 
reserved 
reserved 
R[0-1 1 ]-to-Cube-Pi  n[0-1 1 
Router-to-Cube-Pi  n[0-1 1 ] 
reserved' 
rese  rved 
reserved 
rese  rved 
rese  rved 
rese  rved 
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26 

: reserved 

27 

:  reserved 

28 

: reserved 

29 

: reserved 

30 

:  reserved 

31 

: none) 

)) 
#l 


The  data  coming  in  the  cube  pins  may  be 
active.  LATCHR  is  unclocked.  In  other 
latched.  Which  cube  pins  are  read,  and 
is  determined  by  the  Which-Dims-Receive 
shown  in  Table  7.  Values  0-11  are  used 
for  reading  input  from  ECC5  pin 
reading  input  from  the  I/O  pin. 


loaded  into  the  Cube-In  latch  when  LATCHR  is 
words,  whenever  latchr  is  active  a  latch  is 
which  bits  of  the  Cube-In  latch  are  loaded, 
field  of  the  Cube— Control   rug  register,  as 
for  NEWS  communication.    Value  28  is  used 
(for  alpha  compat i bi I ty) .     Value  29  is  used  for 
Value  30  is  used  for  NEWS  communication  in 


processors  that  are  at  the  edge  of  the  grid  and  should  receive  a  boundary  value. 
Value  18  is  used  for  direct  cube  writes  and  for  routing. 


(defconst  cm:*whi  ch-d  i  ms-rec  i  eve-encod  i  ng* 

;;  Table  7.    Which-Dims-Receive  (5  bits) 

;;  Which-Dims-Receive  I  Source  Read  I  Latches 


I oaded 


)) 
#l 


'((0 

: Cube-Pi  n 

'0' 

-to-Cube-In 

'Cube-Se  1  \ 

:Cube-Pi  n 

'1' 

-to-Cube-In 

Cube-Se 1 ' 

[I 

: Cube-Pi  n 

'2' 

-to-Cube-In 

Cube-Se 1  [ 

(3 

: Cube-Pi  n 

'3' 

-to-Cube-In 

Cube-Se 1  [ 

(4 

: Cube-Pi  n 

V 

-to-Cube-In 

^Cube-Se  i  [ 

(5 

: Cube-Pi  n 

5' 

-to-Cube-In 

'Cube-Se  1  [ 

(6 

: Cube-Pi  n 

'6' 

-to-Cube-I n 

'Cube-Sel ' 

(7 

: Cube-Pi  n 

'7' 

-to-Cube-I n 

Cube-Se 1 [ 

(8 

: Cube-Pi  n 

'8' 

-to-Cube-I n 

Cube-Se 1 ' 

(9 

:Cube-Pi  n 

V 

-to-Cube-I n 

Cube-Se 1 ' 

(10 

:Cube-Pi  n 

10]-to-Cube-In[Cube-Se I 

(11 

:Cube-Pi  n 

]-to-Cube-In[Cube-Se 1 

(12 

: reserved 

(13 

: reserved 

(14 

: reserved 

(15 

: reserved 

(16 

: reserved 

(17 

: reserved 

(18 

:Cube-Pi  n 

0-11]-to-Cube- 

-In[0-15]) 

(19 

: reserved 

(20 

: reserved 

(21 

: reserved 

(22 

: reserved 

(23 

: reserved 

(24 

: reserved 

(25 

: reserved 

(26 

: reserved 

(27 

: reserved 

(28 

:ECC5-Latch- 

-to-Cube-In[Cube-Se 1 ] ) 

(29 

:  IO-Latch- 

-to-Cube-In[Cube-Se 1 1) 

(30 

: Edge-Latch- 

■To-Cube-In[Cube-Se 1 ]) 

(31 

: none) 

Cube-In[12-15]  are  cleared 


The  standard  NEWS  cycle  is  then  LOADA[BSEL=1 ]  RUG-NEWS 
{STORE}*,  where  each  pair  of  STORE  cycles  has  a  different  value 
for  the  CUBE  field,  and  of  each  pair  one  asserts  Sendr-even/Latchr-odd 
and  the  other  asserts  Sendr-odd/Latchr-even  (the  second  STORE  may  be  a 
NOP  instead,  since  the  important  data  is  latched).     The  transfer  of  bits 
is  pipelined  by  one  LLS*  cycle.     The  LOADA  copies  the  Cube-In  data  just 
received  to  the  B  latch,  as  well  as  loading  the  A  latch  in  the  usual 
manner.     The  RUG-NEWS  reads  from  memory  data  to  be  sent  (on-chip  and 
off-chip)  and  loads  it  into  the  R  latch  and  Cube-In  latch.     Then  the 
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STORE  instructions  store  the  data  just  received  (now  in  the  B  latch), 
combined  with  the  A  data  if  desired,   into  memory  (many  times, 
rsdundant ly) ,  and  also  transfer  data  across  the  cube  wires,  sourced  fro 
the  R  latches  and  stored  into  the  Cube-In  latches  for  the  next  cycle. 
To  transfer  N  bits  therefore  requires  N  LoadA[Bse 1=1 ]  Rug-News  {Store}* 
cycles  followed  by  a  LoadA[Bse 1=1 ]  Rug-News  {Store}  cycle  to  store  the 
last  bit. 
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Rules  for  Clocking: 
There  is  only  one  clock  pin  on  the  chip. 

When  the  clock  is  0,  the  other  input  pins  may  change  arbitrarily 
without  affecting  the  state  of  the  chip,  except  Latchr.     If  latchr  is 
asserted  then  the  latch  is  opened  to  secure  the  data  coming  in  over  the 
cube  lines.     Before  the  clock  becomes  active,  all  pins  must  be  stable 
for  some  setup  time. 

When  the  clock  is  1,  all   instruction  pins  must  remain  stable. 
Memory  pins  may  change  but  must  remain  stable  for  time  not   less  than  a 
setup  time  before  the  clock  becomes  Low.     Output  pins  may  become  driven 
as  early  T-CLH-on  after  the  inactive  to  active  transition,  but  are  not 
guaranteed  to  reach  these  correct  values  until  T-CLH-ready  after  the 
transition.  In  the  case  of  output  pins  that  depend  directly  on  input 
pins  in  the  current  cycle,  the  output  will   reach  their  stable  state 
T-CLH-ready  after  the  transition  or  T-propagate  after  the  corresponding 
input  pins  reach  their  proper  states,  which  is  even  later.     After  the 
clock  goes  low,  all  pins  must  remain  stable  for  at  least  T-CHL-hold. 

The  internal  control  circuitry  is  designed  so  that  if  the  chips  are 
wired  properly  no  software  initialization  error  can  cause  two  internal 
drivers  to  simultaneously  drive  the  same  line. 


Initialization: 

After  power-up,  the  following  steps  must  be  taken  in  sequence  to  assure  the 
proper  initialization  of  the  processors  and  router  <> 

-Initialize  the  rug  (this  also  initializes  the  error  condition  in  the  rug) 
—Write  zero  to  each  of  processors  flags 
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ELECTRICAL  SPECIFICATIONS 
Inputs 
Outputs 

Powe r  Supp I i  es 
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Pin  Summary 


Name 

Numb©  r 

t  /n 
l/u 

Assertion  level  Comment 

Control  Pins 

Clock 

1 

Input 

High 

Strobe  to  execute  instuction 

t  CTi  PC 

4 

1 

1  n/uu  t 

Low 

Chip  select,  active  low  and  LED 

Error 

1 

Output 

Low 

Error  during  current  cycle,  open 

dra  i  n 

I/O 

1 

In/Out 

High 

Global  output  and  input,  open  drain 

Processor  Inst  rue t  i  on 

Pi  ns 

ADO  O 

•i 
o 

input 

High 

What  type  of  instruction 

10-16 

16+1 

In/Out 

High 

Instruction  and  Data  bus  +  parity 

Memory  Pins 

M0-15 

16 

In/Out 

High 

Memory  bus  Data 

ECC0 

1 

In/Out 

High 

Error  Check  Bit  0  or  Memory  bus  Data 

tUU  1  — O 

c 
O 

T  n  /r\,  ■  + 

1  n/uu  t 

H  i  gh 

f—                     (-11            l ,     n;j._      4  c 

Error  Check  Bits  1—5 

Communications  Pins 

Cube0-1 1 

12 

. In/Out 

Low 

Cube  connection  0-11 

LatchR 

1 

I  nput 

High 

Latch  router  inputs 

SendR 

1 

Input 

Low 

Send  router  outputs 

Power  Pins 

VDD 

4 

powe  r 

High 

+5  power 

VSS 

4 

powe  r 

Low 

ground 

Total 

68 
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PINS 


I/O  Pin 


I/O 

Comments 

LoadA 

I 

Input  for  i/o  latch 

LoadB 

I 

Input  for  flag  read 

LoadI 

Z 

Not  specified 

( 1 ow  act  i  ve) 

Store 

0 

Output  of  global  signal 

unless  global  enable  is 

i  nact  i  ve 

Read 

z 

Not  specified 

Route 

0 

Output  of  buffer  empty 

( low  act  i  ve) 

unless  global  enable  is 

i  nact  i  ve 

NOP 

z 

Not  specified 

RUG 

z 

Not  spec  1 f  i  ed 

The  Global  signal   (the  output  part  of  the  I/O  pin)  is  used  for 
communication  with  the  host  computer  and  for  fast  I/O.     The  logical  NOR 
of  each  processor's  Carry  is  available  on  the  I/O  pin  on  the  Store 
instruction.    On  the  route  instruction  the  Buffer  Empty  signal    is  put  on 
the  1/0  pin.     That  is,   if  a  chip  has  a  router  buffer  that  is  NOT  empty 
then  it  asserts  the  global. 


Led/CS  Pin 


Led/CS 

LoadA 

X 

LoadB 

X 

LoadI 

CS 

Store 

Led 

Read 

CS 

Route 

Led 

NOP 

X 

RUG 

CS 

ike  the  I/O  pin. 


Comment  s 

Not  spec  i  f  i  ed 

Not  spec  i  f  i  ed 

Input  for  setting  COND  Latch 
Output  of  global  signal  (low  active) 
unless  LED  enable  is  inactive 
Input  for  memory  parity  checking 
Output  of  buffer  empty    (low  active) 
unless  LED  enable  is  inactive 
Not  specified 
Input 


Led  enable 

1 

1 

0 

0 

ERROR  PIN 


Global 

0 

1 

0 
1 


LED 

1  (on  the  pin  is  5V,   light  is  ON) 

0 

0 

0 


A  error  is  signalled  if: 
(and    <There  is  an  error> 
Enab I e-Er  ror) 

An  error  is  signalled  only  once  so  that  additional  errors  can  be 
detected . 

Comments 

or  Memory— error  Inst ruct ion-error} 
or  Memory— error  Instruction-error) 
Inst  ruct  i  on-e  r ror 

(or  Memory-error  Instruct  ion— error) 
Memory-error 

(or  Memory-error  Instruction-error 

(and  enab I e-message-e rror  message-error)) 
Never 

(or  Memory-error  Instruction-error) 

The  Error  pin  is  always  driven  directly  from  the  ERROR  latch,  unless 
Enable-Error  is  not  asserted. 

Memory  Pins  and  Instruction  Pins 

M  I/O      I  I/O  Comments 


Enab I ed 

LoadA 

Yes 

LoadB 

Yes 

LoadI 

Yes 

Store 

Yes 

Read 

(if  cs  yes) 

Route 

yes 

NOP 

no 

RUG 

yes 
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LoodA 

I 

T 
1 

LoadB 

I 

T 
i 

LoadI 

I 

I 

Store 

0 

I 

Read 

I 

0 

Route 

half 

I 

I 

half 

0 

NOP 

X 

RUG 

I  or 

0 

I 

Memory  is  split,  depending  on  odd  bit  in 
Instruct  ion. 


Memory  is  I  if  Read  bit  is  disabled 
(write  the  rug).    Memory  is  0  if  Read 
is  enabled  (read  the  rug) 


bit 


Cube  Pins 

If  sendr  active  (unclocked):  output 

Else  input 


Pi  nouts : 
l# 

(defconst  cm : *ch i p-p i nouts*  '( 


;  Name 


Pin  # 


Type 


Top  side  start  i ng  at 


-pi 
-pi 
-pi 
-pi 
-pi 
-pi 
-pi 
-pi 
vss 
vdd 
-pi 
-pi 
-pi 
-pi 
-pi 
-pi 
-pi 


;  Right  side 
( i-pi  ns[16] 
( ECC0 
(M-p  i  ns 
(M-p  i  ns 
(M-pi  ns 
(M-p  i  ns 
(M-p i  ns 
(vdd 
(vss 
(M-pi  ns 
(M-pi  ns 
(M-p  i  ns 
(cube-p  i  ns 
r cube-pi  ns 
(cube-pi  ns 
f cube-p  i  ns 
(cube— pi  ns 


10] 


Bottom  side 
fcube-pi  ns[6] 
ECC2 
ECC1 
ECC3 
ECC4 

(Error-pi  n 
I/O 
'vdd 
[vss 


9 
8 
7 
6 
5 
4 
3 
2 
1 

68 
67 
66 
65 
64 
63 
62 
61 

start  i  ng 

60 

59 

58 

57 

56 

55 

54 

53 

52 

51 

50 

49 
1  48 
J  47 

46 

45 

44 

start  i  ng 
43 
42 
41 
40 
39 
38 
37 
36 
35 


left 
bid 
bid 
bid 
bid 
bid 
bid 
bid 
bid 
vss 
vdd 
bid 
bid 
bid 
bid 
bid 
bid 
bid 


I ook  i  ng 
i  rect 
i  rect 
i  rect 
i  rect 
i  rect 
i  rect 
i  rect 
i  rect 


i  rect 
i  rect 
i  rect 
i  rect 
i  rect 
i  rect 
i  rect 


Alpha  Version  Names 
from  component  side. 


at 


top. 

bidi 

bidi 

bidi 

bidi 

bidi 

bidi 

bidi 

vdd 

vss 

bidi 

bidi 

bidi 

bidi 

bidi 

bidi 

bidi 

bidi 


rect 
rect 
rect 
rect 
rect 
rect 
rect 


rect 
rect 
rect 
rect 
rect 
rect 
rect 
rect 


at  right, 
bidi  rect 
bidi  rect 
bidi  rect 
bidi  rect 
bidi  rect 
output 
bidi  rect 
vdd 
vss 


i-p  i  ns 

1' 

i— p  i  ns 

'2' 

i-pi  ns 

'3' 

i— p  i  ns 

4' 

i-p  i  ns 

5' 

: 

i-p  i  ns 

6' 

) 

i— pi  ns 

7' 

i 

i— p  i  ns 

'8' 

vss) 

vdd) 

i— p  i  ns 

[9]) 

i-p  i  ns 

10 

i-pi  ns 

11 

i-p  i  ns 

12 

i— p I ns 

"13' 

i-pi  ns 

°14' 

i— p  i  ns 

15' 

i-pi  ns 

'16 

M-p  i  ns 

'16 

M-p  i  ns 

'15 

M-p  i  ns 

'14 

M-p  i  ns 

'13 

M-p  i  ns 

12 

M-pi  ns 

11 

vdd) 

vss) 

M-pi  ns 

[101) 

M-p  i  ns 

1 

M-pi  ns 

cube-pi  ns 

11 

cube-pi  ns 

10 

cube-pi  ns 

9' 

cube-pi  ns 

8' 

cube-pi  ns 

7' 

cube-p  i  nsf 6] ) 

WS-pin) 

E-Pin) 

N-Pin) 

LED-pin) 

Er  ror-p  i  n) 

G I oba 1-pin) 

vdd) 

vss) 
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c 1 ock-p  i  n 

34 

i  nput 

ECC5 

33 

b  i  d  i  rect 

Led/CS 

32 

b i d i  rect 

sendr-pi  n 

31 

i  nput 

1 atchr- 

sin  30 

i  nput 

OP-Pi  ns 

2 

29 

i  nput 

OP-Pi  ns 

'1 ' 

28 

i  nput 

OP-Pi  ns 

'e' 

27 

i  nput 

c I ock-pi  n) 
n) 

:l 

>i  n) 
"2] 

1' 
0' 


IN-pi 
cs  I  ow 
send  r 


I  a  t  c  h  r— 
OP-Pi ns 
OP-Pi ns 
OP-Pi  ns 


Left  side  starting  at  bottom. 


i  ns 

0 

i  ns 

'1 

i  ns 

'2 

i  ns 

'3 

i  ns 

'4 

i  ns 

'5 

[?] 

26 
25 
24 
23 
22 
21 
20 
19 
18 
17 
16 
15 
14 
13 
12 
1 1 
10 


b  i  d  i  rect 
b  i  d  i  rect 
bidi  rect 
bidi  rect 
bidi  rect 
bidi  rect 
bidi  rect 
bidi  rect 
vss 
vdd 

bidi  rect 
bidi  rect 
bidi  rect 
bidi  rect 
bidi  rect 
bidi  rect 
bidi  rect 


cube-pi  ns 
cube-pi  ns 
cube-pi  ns 
cube-pi  ns 
cube-pi  ns 
cube-pi  ns 
M-pi  ns[0l ) 
M-p  i  ns[ 1  J ) 
vss) 
vdd) 
M-p  i  ns 
M-p  i  ns 
M-p  i  ns 
M-p  i  ns 
M-p  i  ns 
M-p  i  ns 
i-p  i  ns 
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ALL  CYCLES 


|  T-c-l- 


CLOCK 


■I- 


-T-c-h- 


■I 


OP-PINS 


I  T-op-su- 


|-T-op-h-| 


I-PINS 


ERROR 


_%%%%% 


|— T-i-su— | 


-T-e— ac- 


~  ^So/c/ a/as a/a/ a/a/a/ a/ &/c/o/o_, 


|-T-i-h-| 


|-T-e-h-| 


_%%%%%%% 


READ  CYCLE 
CLOCK      _  I 


I-PINS 


-T-i-ac- 


i-T-i-z-| 


-T-mi-ac  1 


I T- i  r-h- 1 
 T-i-z- 


-T-mi-h—  | 


M-PINS 


LED/— CS 


-T-cs-ac- 


-T-cs-ac- 


LOAD  or  RUG  CYCLE 
CLOCK      ~~ I  


M-PINS 

I/O-PIN 

LED/-CS 


|  T-m-s  u- 


|  T-i  n-su- 


-<%%%%%%%%%%%%%% 


•  | T- 1 m-h- 1 


_%%%%%%% 


- 1  T—  i  n— h—  | 


-T-cs-su  1  T-cs-h- 1 
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STORE  CYCLE 


CLOCK 
I-PINS 


M-PINS 


I/O  PIN 


-T-m-acf-(f rom  falling  edge  of  last  clock)- 
I  T-m-ac  i  


-T-m-acr- 


|  T-sm-z-| 


-T-o-ac  1 


-T-o-z-l 


T-sm-h-| 
I  T-m-z- 


|T-so-h-| 
 T-o-z- 


LED 


-T-l-z- 


-T-l-ac  1 


-<%2 


T-s I -h- | 
 T-l-z- 


ROUTE  CYCLE 


CLOCK 

LATCHR 

(in) 

CUBE 

(out) 
SENDR 

I/O 


_%%%%%%%%%>— 


-T-l  r-su- 


-T-l  r-h- 


|-T-c i-su — l-T-c  i-h- 


 T-co-ac- 

-T-co-z-l 


-T-co-h-| 
— T-co-z- 


-T-g-ac  —  | 


|-T-g-ac-| 


-<35%%%%%JS%%%%%%% 
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*  TIMING  TABLE  FOR  THE  CMCHIP 


NAME 

CYCLE 

PIN 

DESCRIPTION 

NSEC. 

T-c-l 

CLOCK 

CLOCK  LOW 

50 

MIN 

T-c-h 

CLOCK 

CLOCK  HIGH 

50 

MIN 

T-op-su 

OP-PINS 

SETUP  TIME 

-2 

MIN 

T-op-h 

OP-PINS 

HOLD  TIME 

20 

MIN 

T-i-su 

INSTRUCTION  PINS 

SETUP  TIME 

-2 

MIN 

T-i-h 

INSTRUCTION  PINS 

HOLD  TIME 

20 

MIN 

T-e-ac 

ERROR  PIN 

ACCESS  TIME 

30 

MAX 

T-e-h 

ERROR  PIN 

HOLD  TIME 

8 

MIN 

T-m-su 

LOAD 

MEMORY  PINS 

SETUP  TIME 

-2 

MIN 

T-m-h 

LOAD 

MEMORY  PINS 

HOLD  TIME 

20 

MIN 

T— cs-su 

LOAD 

SC  PIN 

SETUP  TIME 

-2 

MIN 

T-cs-h 

LOAD 

SC  PIN 

HOLD  TIME 

20 

MIN 

T-i  n-su 

LOAD 

INPUT  PIN 

SETUP  TIME 

-2 

MIN 

T-in-h 

LOAD 

INPUT  PIN 

HOLD  TIME 

20 

MIN 

T-i-z 

READ 

INSTRUCTION  PINS 

TRI-STATE  TIME 

8 

MIN 

T-i-z 

READ 

INSTRUCTION  PINS 

TRI-STATE  TIME 

30 

MAX 

T-i-ac 

READ 

INSTRUCTION  PINS 

ACCESS  TIME 

30 

MAX 

T-i  r-h 

READ 

INSTRUCTION  PINS 

HOLD  TIME 

8 

MIN 

T-m  i  — h 

READ 

MEMORY  PINS 

HOLD  TIME 

8 

MIN 

T-mi— ac 

READ 

MEMORY  PINS 

ACCESS  TIME  (ecc  mode) 

60 

MAX 

T-mi-ac 

READ 

MEMORY  DATA  PINS 

ACCESS  TIME  (parity  mode) 

20 

MAX 

T-mi-ac 

READ 

MEMORY  PARITY  PIN 

ACCESS  TIME  (parity  mode) 

46 

MAX 

T— cs-ac 

READ 

LED-CS  PIN 

ACCESS  TIME  (fal 1 ) 

15 

MAX 

T-cs-ac 

READ 

LED-CS  PIN 

ACCESS  TIME  (rise) 

8 

MIN 
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T— m— acf 

STOKt 

MtMOKT  Hi No 

AUUtbo   rKUM   rALL      ^©CC  fTlOCie^ 

1  AC* 

MA  V 
MAA 

T-m— acf 

S  I  OK  b. 

RATA    D  T  kIC 

MLMUKY  UAIA  riNi 

AUUtbb  rKUM  rALL     ^parity  roociey 

IjIA  V 
MAX 

T— m— acf 

b  1  UKt 

tAClAC\DV    DADTTV  DTK) 

MtMUKl   rAKlIT  r  IN 

AUULbb   rKUM  rALL      ^parity  moaej 

MAY 

MAa 

T— tti— ac  r 

b  1 UKt 

MLMUKY  rlNb 

A^PCQ    rnnu  Dior 
AUL-C-oo   rKUM  KlbL 

A^ 

UA  V 

MAa 

T— m— ac  i 

3  1  UKt 

iiCiiODV    D  T  MO 

Mt-MUKT  rlNb 

aouljo  r KUwi  iNoi  .   ^ecc  moae ) 

7D1 

MAY 
MAa 

T— in— ac  i 

STORE 

UCIJ^DV    D  T  kIC 

MtMOKT  rlNb 

AL-ULbb  rKUM  lNoi.   ^parity  ntoae ^ 

/O 

MAa 

T— g— su 

LOAD 

1  AD  A  1 

GLOBAL 

C  UT\  ID    T  T  kJC" 

be. 1 Ur    1  1Mb 

Ml  IN 

T-g— h 

LOAP 

i  An  »  i 

(jLUdAL 

uai  r»  TT1JC" 

HULL)    i  1Mb 

Ml  JN 

T— g— ac 

blOKt 

OLUdAL 

AUL-tbb    1  IML 

M/vo 

T— g— h 

STORE 

GLOBAL 

HULL)    1  1Mb 

kit  T  KI 
M 1  N 

T-g-z 

STORE 

GLOBAL 

TD  T     CT  A  T  C    T  T  kiT 

1 K I—b 1  A  1  t    1  IML 

MT  M 
Mir* 

T-g-r 

STORE 

bLOBAL 

KtLLAbL    1  IML 

MA/c 

T-t-ac 

STORE 

LED 

ACCESS  TIME 

MA% 

T-l-h 

STORE 

LED 

HOLD  TIME 

MIN 

T-l-z 

STORE 

LED 

TRI-STATE  TIME 

MIN 

T-l-r 

STORE 

LED 

RELEASE  TIME 

MA% 
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NOTE:    BETA  EUNUCH  simulated  times  can  be  found  in: 
"a:>cm>beta>t  imi  ng>eunuch-t imes. text "  . 

Quest  ions: 

25.  add  I/O  sequence  to  sequences  section 

29.  off  chip  wires  not  protected  by  parity: 
OP 

News 
g I oba I 
error 
I/O 
msgp 

Direct  cube  writing 
Sendr  latchr 

30.  Direct  cube  write  sequence  in  sequences  section  looks  wrong. 


end 
l# 


